PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa17g008510.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family HD-ZIP
Protein Properties Length: 717aa    MW: 78806.9 Da    PI: 5.6211
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa17g008510.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox61.51.2e-1965121157
                     TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHHC CS
        Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakekk 57 
                     +++ +++t+ q++e+e++F+++++p+ ++r++L+++lgL+  qVk+WFqN+R+++k+
  Csa17g008510.1  65 KKRYHRHTQLQIQEMEAFFKECPHPDDKQRKQLSRELGLEPLQVKFWFQNKRTQMKN 121
                     688999************************************************995 PP

2START218.62.1e-682524652206
                     HHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEECT CS
           START   2 laeeaaqelvkkalaeepgWvkssesengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetleviss 87 
                     la +a++el+++a++++ +W++++   + +e+ ++f+++ +     +++ea+r+++vv+m++++ ve+l+d++ qW++ +a    +a+tl+v+s+
  Csa17g008510.1 252 LAVAAMEELMRMAQVDDSLWKSLV--FDDEEYARTFPRGIGprpagFRSEASRETAVVIMNHVNIVEILMDVN-QWSTIFAgmvsRAMTLAVLST 343
                     6789********************..************999********************************.********************* PP

                     T......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SS CS
           START  88 g......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkgr 175
                     g      galq+m+ae+q++splvp R+ +f+Ry++q+g+g+w++vd+S+ds q++p     +R++++ Sg+li++++ng+skvtwvehv++++r
  Csa17g008510.1 344 GvagnfnGALQVMTAEFQVPSPLVPtRETYFARYCKQQGDGSWAVVDISLDSLQPNPP----ARCRRRASGCLIQEMPNGYSKVTWVEHVEVDDR 434
                     *********************************************************8....********************************* PP

                     XXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
           START 176 lphwllrslvksglaegaktwvatlqrqcek 206
                      +h+l++++v++g+a+gak+wva l+rqce+
  Csa17g008510.1 435 GVHSLYKHMVSTGHAFGAKRWVAILDRQCER 465
                     *****************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.601.5E-2143120IPR009057Homeodomain-like
SuperFamilySSF466894.6E-1953122IPR009057Homeodomain-like
PROSITE profilePS5007116.40662122IPR001356Homeobox domain
SMARTSM003896.2E-1864126IPR001356Homeobox domain
PfamPF000463.2E-1765120IPR001356Homeobox domain
CDDcd000867.86E-1965123No hitNo description
PROSITE patternPS00027097120IPR017970Homeobox, conserved site
PROSITE profilePS5084841.829242468IPR002913START domain
SuperFamilySSF559611.28E-33242467No hitNo description
CDDcd088751.12E-121246464No hitNo description
SMARTSM002342.6E-58251465IPR002913START domain
PfamPF018522.5E-60252465IPR002913START domain
Gene3DG3DSA:3.30.530.203.0E-5341432IPR023393START-like domain
SuperFamilySSF559617.61E-26484708No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 717 aa     Download sequence    Send to blast
MFEPNMLLAA MNNADSNNHN YNHEDNNNEG FLRDDEFDSA NTKSGSENQE GGSGNDQDPL  60
HPNQKKRYHR HTQLQIQEME AFFKECPHPD DKQRKQLSRE LGLEPLQVKF WFQNKRTQMK  120
NHHERHENSH LRTENDKLRS DNIKYREALA NASCPNCGGP TAIGEMSFDE HQLRLENARL  180
REEIDRISAI AAKYVGKPVS NYPLMSPPPL PPRPLELGMG NFGGEAYGNN PTDLLKSITT  240
PTEADKPVII DLAVAAMEEL MRMAQVDDSL WKSLVFDDEE YARTFPRGIG PRPAGFRSEA  300
SRETAVVIMN HVNIVEILMD VNQWSTIFAG MVSRAMTLAV LSTGVAGNFN GALQVMTAEF  360
QVPSPLVPTR ETYFARYCKQ QGDGSWAVVD ISLDSLQPNP PARCRRRASG CLIQEMPNGY  420
SKVTWVEHVE VDDRGVHSLY KHMVSTGHAF GAKRWVAILD RQCERLASVM ATNISSGEVG  480
VITNQEGRRS MLKLAERMVI SFCAGVSAST AHTWTTLSGT GAEDVRVMTR KSVDDPGRPP  540
GIVLSAATSF WIPVPPKRVF DFLRDENSRN EWDILSNGGV VQEMAHIANG RDTGNCVSLL  600
RSANSSQSNM LILQESCTDP TASFVIYAPV DIVAMNIVLN GGDPDYVALL PSGFAILPDG  660
NANGGGDGGS LLTVAFQILV DSVPTAKLSL GSVATVNNLI ACTIERIKAS MSCETA*
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAK3167760.0AK316776.1 Arabidopsis thaliana AT1G05230 mRNA, complete cds, clone: RAFL09-78-H10.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010475208.10.0PREDICTED: homeobox-leucine zipper protein HDG2-like isoform X2
SwissprotQ94C370.0HDG2_ARATH; Homeobox-leucine zipper protein HDG2
TrEMBLB3H6Y40.0B3H6Y4_ARATH; Homeobox-leucine zipper protein HDG2
STRINGAT1G05230.10.0(Arabidopsis thaliana)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM49128149
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G05230.30.0homeodomain GLABROUS 2